NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

MPO: An Efficient Post-Processing Framework for Mixing Diverse Preference Alignment

Wang, T; Gui, D; Hu, Y; Lin, S; Zhang, L (July 2025, Proceedings of Machine Learning Research)

Reinforcement Learning from Human Feedback (RLHF) has shown promise in aligning large language models (LLMs). Yet its reliance on a singular reward model often overlooks the diversity of human preferences. Recent approaches address this limitation by leveraging multi-dimensional feedback to fine-tune corresponding reward models and train LLMs using reinforcement learning. However, the process is costly and unstable, especially given the competing and heterogeneous nature of human preferences. In this paper, we propose Mixing Preference Optimization (MPO), a post-processing framework for aggregating single-objective policies as an alternative to both multi-objective RLHF (MORLHF) and MaxMin-RLHF. MPO avoids alignment from scratch. Instead, it log-linearly combines existing policies into a unified one with the weight of each policy computed via a batch stochastic mirror descent. Empirical results demonstrate that MPO achieves balanced performance across diverse preferences, outperforming or matching existing models with significantly reduced computational costs.
more » « less
Free, publicly-accessible full text available July 17, 2026
FactTest: Factuality Testing in Large Language Models with Finite-Sample and Distribution-Free Guarantees

Nie, F; Hou, X; Lin, S; Zou, J; Yao, H; Zhang, L (July 2025, Proceedings of Machine Learning Research)

The propensity of large language models (LLMs) to generate hallucinations and non-factual content undermines their reliability in high-stakes domains, where rigorous control over Type I errors (the conditional probability of incorrectly classifying hallucinations as truthful content) is essential. Despite its importance, formal verification of LLM factuality with such guarantees remains largely unexplored. In this paper, we introduce FACTTEST, a novel framework that statistically assesses whether an LLM can provide correct answers to given questions with high-probability correctness guarantees. We formulate hallucina- tion detection as a hypothesis testing problem to enforce an upper bound of Type I errors at user-specified significance levels. Notably, we prove that FACTTEST also ensures strong Type II error control under mild conditions and can be extended to maintain its effectiveness when covariate shifts exist. FACTTEST is distribution-free and and model-agnostic. It works for any number of human-annotated samples and applies to any black-box or white-box LM. Extensive experiments demonstrate that FACTTEST effectively detects hallucinations and enable LLMs to abstain from answering unknown questions, leading to an over 40% accuracy improvement.
more » « less
Free, publicly-accessible full text available July 17, 2026
Multi-DU fronthaul design leveraging power over fiber

Erbayat, E; Figueiredo, G; Lin, S-C; Matsuura, M; Hasegawa, H; Subramaniam, S (May 2025, IFIP)

Free, publicly-accessible full text available May 6, 2026
Power-over-fiber using a pure-silica inner-cladding double-clad fiber and 976 nm photovoltaic power converter for improving power transmission efficiency

Yaguchi, Y; Miyakawa, Y; Sugiura, S; Lin, S-C; Subramaniam, S; Hasegawa, H; Masson, D; Fafard, S; Matsuura, M (September 2024, European Conference on Optical Communications (ECOC))

Full Text Available
Non-Convex Bilevel Optimization with Time-Varying Objective Functions

Lin, S; Sow, D; Ji, K; Liang, Y; Shroff, N (February 2024, Conference on Neural Information Processing Systems)

Bilevel optimization has become a powerful tool in a wide variety of machine learning problems. However, the current nonconvex bilevel optimization considers an offline dataset and static functions, which may not work well in emerging online applications with streaming data and time-varying functions. In this work, we study online bilevel optimization (OBO) where the functions can be time-varying and the agent continuously updates the decisions with online streaming data. To deal with the function variations and the unavailability of the true hypergradients in OBO, we propose a single-loop online bilevel optimizer with window averaging (SOBOW), which updates the outer-level decision based on a window average of the most recent hypergradient estimations stored in the memory. Compared to existing algorithms, SOBOW is computationally efficient and does not need to know previous functions. To handle the unique technical difficulties rooted in single-loop update and function variations for OBO, we develop a novel analytical technique that disentangles the complex couplings between decision variables, and carefully controls the hypergradient estimation error. We show that SOBOW can achieve a sublinear bilevel local regret under mild conditions. Extensive experiments across multiple domains corroborate the effectiveness of SOBOW.
more » « less
Full Text Available
Experimental demonstration of 128×128 optical cross-connects with 2.45 Pbps throughput

https://doi.org/10.1049/icp.2023.2536

Ochiai, T; Kuno, T; Munakata, R; Mori, Y; Lin, S-C; Matsuura, M; Subramaniam, S; Hasegawa, H (January 2024, European Conference on Optical Communication)

Full Text Available
Warm-Start Actor-Critic: From Approximation Error to Sub-optimality Gap

Wang, H.; Lin, S.; Zhang, J. (July 2023, Proc. International Conference on Machine Learning (ICML))

Full Text Available
Annealing reduces ${Si}_{3} N_{4}$ microwave-frequency dielectric loss in superconducting resonators

https://doi.org/10.1103/PhysRevApplied.21.054044

Mittal, S.; Adachi, K.; Frattini, NE; Urmey, MD; Lin, S-X; Emser, AL; Metzger, C.; Talamo, LG; Dickson, S.; Carlson, D.; et al (May 2024, Physical Review Applied)
Measurement of the free neutron lifetime in a magneto-gravitational trap with in situ detection

https://doi.org/10.1103/PhysRevC.111.045501

Musedinovic, R.; Blokland, L_S; Cude-Woods, C_B; Singh, M.; Blatnik, M_A; Callahan, N.; Choi, J_H; Clayton, S_M; Filippone, B_W; Fox, W_R; et al (April 2025, Physical Review C)
Endoscope Localization and Dense Surgical Scene Reconstruction for Stereo Endoscopy by Unsupervised Optical Flow and Kanade-Lucas-Tomasi Tracking

https://doi.org/10.1109/EMBC48229.2022.9871588

Yang Z, Lin S (September 2022, Annual International Conference of the IEEE Engineering in Medicine and Biology Society)

Full Text Available

« Prev Next »

Search for: All records